CHEMICAL TREES MINIMIZING ENERGY AND HOSOYA INDEX 

CLEMENS HEUBERGER AND STEPHAN G. WAGNER 

Abstract. The energy of a molecular graph is a popular parameter that is defined as the 
sum of the absolute values of a graph's eigenvalues. It is well known that the energy is related 
to the matching polynomial and thus also to the Hosoya index via a certain Coulson integral. 
Trees minimizing the energy under various additional conditions have been determined in the 
past, e.g., trees with a given diameter or trees with a perfect matching. However, it is quite a 
natural problem to minimize the energy of trees with bounded maximum degree — clearly, the 
case of maximum degree 4 (so-called chemical trees) is the most important one. We will show 
»Oi , that the trees with given maximum degree that minimize the energy are the same that have 

■ been shown previously to minimize the Hosoya index and maximize the Merrifield- Simmons 

C^ ' index, thus also proving a conjecture due to Fischermann et al. Finally, we show that the 

minimal energy grows linearly with the size of the trees, with explicitly computable growth 
constants that only depend on the maximum degree. 
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1. Introduction and statement of results 



o 
u 

C^ , The energy is a graph parameter stemming from the Hiickel molecular orbital (HMO) 

approximation for the total vr-electron energy, see OH]. It is defined as the sum of the 
absolute values of all eigenvalues of a graph: if Ai, A2, • • • , A^ denotes the spectrum of a graph 
G (i.e. the spectrum of its adjacency matrix), one has 

> 

EiG) = ^\X, 

in 

p 

Tjij- ■ For more information about the chemical importance of the energy as well as important 

^D I properties, we refer to the book [5j and the survey article [1]. It is known that there are many 

interesting relations between the spectrum and matchings in the case of trees. This is due 
to the fact that the characteristic polynomial (j){T, x) = det(x/ — A{T)), where A{T) denotes 
the adjacency matrix of a tree T and I the identity matrix, can be expressed as [Ij 



y: (1) c^{T,x) = ^{-l)'^miT,k)x--'\ 

fe>0 
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where m{T, k) denotes the number of niatchings of T of cardinaUty k. It follows from this 
representation that the energy can actually be computed by means of the Coulson integral [5] 



(2) 



E{T) = - r x-^\og (^m(r,A;)x2M 



dx. 



This connection to matchings has been used in various instances to determine the extremal 
values of the energy within certain classes of graphs (trees or graphs that are closely related 
to trees, such as unicyclic graphs — see [9j). We mention, for instance, the recent paper of Yan 
and Ye [11], where trees with prescribed diameter maximizing the energy are characterized. 
An earlier example is the article by Zhang and Li [12j, where trees with a perfect matching 
are studied. 

For obvious reasons, it is also a natural problem to study chemical trees (i.e., trees with 
maximum degree < 4) and generally trees with bounded maximum degree. Fischermann et 
al. [2] noticed that the chemical trees minimizing the energy agree with those minimizing 
the Hosoya index (i.e., the total number of matchings, see [8]) for a small number of vertices. 
Indeed, we will show that this is always the case and also holds for arbitrary given maximum 
degree. The resulting trees have also been shown to maximize the Merrifield-Simmons index 
(i.e., the total number of independent vertex subsets, see [lO]) in an earlier paper by the 
authors [7]. We will show that essentially the same method can be used again. In view of the 
representation ([2]), it is sufficient to minimize the polynomial 

M{T,x) = ^m(T,k)x'' 

k 

for all positive values of x. Surprisingly, it turns out that the result of this minimization 
problem does not depend on x. In order to state our result, we use the notion of complete 
d-ary trees: the complete d-ary tree of height h — 1 is denoted by Ch, i.e., Ci is a single vertex 
and Ch has d branches Ch-i, ■ ■ ■ , Ch-i, cf. Figure [H It is convenient to set Cq to be the 
empty graph. 



O 
(a) Ci for all d 






(b) C2 for d : 



(c) C2 for d = 3 



(d) C3 for d ■- 



Figure 1. Complete d-ary trees 

Let Tn^d be the set of all trees with n vertices and maximum degree < d + 1. We define a 
special tree T* ^ as follows (see also |7j ) : 

Definition 1.1. T*^ is the tree with n vertices that can be decomposed as 
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with Bk^i, . . . , Bk4-i G {Ck, Ck+2} for < k < i and either Bi^i = ■■■ = Bi^a = Q-i or 
Be,i = ■ ■■ = Bi^d = Ci or S^^i, . . . , Bi^d G {Q, ^+1,^+2}, where at least two of B^,i, . . . , 
Bi^d equal C^+i. This representation is unique, and one has the "digital expansion" 

(3) (d-l)n + l = ^afcd^ 

fc=0 
where a^ = {d— 1)(1 + {d+l)rk) and < r^ < d— 1 is the number of B^^i that are isomorphic 
to Ck+2 for k < £, and 

• a^ = 1 if i?£_i = • • • = Bi^d = Ci-i, 

• ae = d if Be^i = ■ ■ ■ = Bg^d = Ce, 

• or otherwise ai = d + {d — l)qi + (d^ — l)rf , where g^ > 2 is the number of B^^i that 
are isomorphic to C^+i and ri the number of Bi^i that are isomorphic to C^+2- 

The tree T^ ^ has already been shown to minimize the Hosoya index and maximize the 
Merrifield-Simmons index over %i,d [7j- In the present paper, we will prove the following 
result: 

Proposition 1.2. Let n and d be positive integers and x > 0. Then T*^ is the unique (up 
to isomorphism) tree in T^^d that minimizes M{T,x). 

Note that the Hosoya index is exactly M(T, 1), and so it is a trivial corollary that T*^ 
minimizes the Hosoya index. Our main theorem is another immediate consequence that 
follows from ([2|): 

Theorem 1. Let n and d be positive integers. Then T*^ is the unique (up to isomorphism) 
tree in T^^d that minimizes the energy. 

In our final section, we will study the asymptotic behavior of the minimal energy: 

Theorem 2. The energy ofT*^ is asymptotically 

E{Tl,) = adn + Oilogn), 

where 



( \ 



2 



E ''-M~'i-V+ E ''-'(■=»^-' 



i>i ^ •" ' i>i 



(4) ad = 2^fd{d - T 

\j=0 mod 2 J = l mod 2 

is a constant that only depends on d. 

2. Recursive Formula for Rooted Trees 

Let X > be fixed. In order to derive recursive formulae for M{T, x), we fix a root r of T 
and we define mi{T, k) to be the number of matchings of cardinality k covering the root and 
mo{T, k) to be the number of matchings of cardinality k not covering the root. 

We write Mj{T,x) = J2k''^j('^^^)^'' ^^^ J ^ {Oil}- Obviously, we have M{T,x) = 
Mo{T, x) + Mi(r, x). The ratio 

Mo(T,x) 



(5) r(r,x) 



M{T,x) 
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will be an important auxiliary quantity in our proofs. The following lemma summarizes 
important properties of all these quantities. 

Lemma 2.1. Let Ti, . . . , T^ be the branches of the rooted tree T . Then the following recursive 
formulcE hold: 



(6) M^{T,x) =J{M{Ti,x), 



i=l 



(7) Mi(r,x)=x^Mo(ri,x)J];M(r„x), 



j=l 3=1 

(8) r{T,x)= / 

Proof. A matching not covering the root corresponds to a selection of an arbitrary matching 
in each branch. If the matching of T is required to have cardinality k, the cardinalities of the 
corresponding matchings in the branches must add up to k. This is exactly the coefficient of 
x^ in the product ni=i M{Ti,x). This proves ^. 

Every matching of T covering the root contains exactly one edge between the root and 
some branch Tj. In this branch, a matching not covering the root of Tj may be chosen, in all 
other branches, arbitrary matchings are allowed. The cardinality of the matching of T is then 
the sum of the cardinalities of the matchings in the branches plus one for the edge incident 
to the root. This yields ([7|). 

Finally, ([8]) is an immediate consequence of ([5]), ([6]) and ([7]). D 

In the following, we will fix a positive integer d and consider only trees whose maximum 
degree is < d+ 1. First of all, we study the behavior of the sequence T^Ch, x). It is convenient 
to set Mq{Cq,x) = and Mi{Co,x) = 1 for the polynomials associated to the empty tree. 
Note that this choice allows adding empty branches without disturbing the recursive formulae 
(0); 0) (©• Then ([H]) translates into a recursion for T{Ch, k) as follows: 

r(Co,a;) = 0, r(Ci,x) = l, r(C/j, x) = — — - — — -. 

1 + dxT[Lh-i,x) 

It is an easy exercise to prove the following explicit formula for r^Ch, x) by means of induction: 
Lemma 2.2. For every x > 0, we have 



f l+Vl+4:dx\^ _ f l-^l+Adx 



V 2 y V 2 



l+v^I+4d^A^+^ fl-Vl+M^'^^+^' 



r{Ch,x) 



Now, the limit behavior of T{Ch,x) for positive x follows immediately. 

Lemma 2.3. For every x > 0, the subsequence t(C2/i,x) is strictly increasing, whereas the 
subsequence t(C2/i+i,x) is strictly decreasing. Both subsequences are converging to the same 
limit - — ,^ , , , and we have 

l+\/l+4dx 

2 

= t{Co,x) <TiC2,x) <■■■ < 7=== < ••• <t{C3,x) <t{Ci,x) = 1. 

1 + V 1 + 4ax 



CHEMICAL TREES MINIMIZING ENERGY AND HOSOYA INDEX 5 

From this point on, the method of [^, in particular Section 5, can be apphed. It suffices 
to replace d by dx at certain points of the proof. For the sake of completeness, we state the 
relevant auxiliary results. 

Lemma 2.4. Let T he a rooted tree and x > 0. Then 

1 



<r(T,x)<l, 



dx + l 
unless T is empty, where t(T, x) = 0. 

Proof. To prove this lemma, use induction and Lemma l2.ll D 

Definition 2.5. Let T be a possibly rooted tree. Then we construct the outline graph of T 
by replacing all maximal subtrees isomorphic to some Cfc, A: > 0, by a special leaf Ck- In this 
process, we attach (d + l — r) leaves Co to internal nodes (non-leaves and non-root) of degree 
r with 2 < r < d. If T is a rooted tree with a root of degree r (1 < r < d), then we also 
attach d — r leaves Co to it. 

The construction ensures that the outline graph of a rooted tree is a rooted d-ary tree, and 
that the outline graph of an arbitrary tree of maximum degree < d + 1 has only vertices of 
degree 1 and d+l. An example is shown in Figure [2j The outline of a rooted tree C^ is just 
the rooted tree consisting of the single leaf C^. 





Figure 2. Reduction to the outline graph {d = 2) 

If enough information on the outline of a rooted tree is available, we can determine it from 
its t(T, x)-value. 

Lemma 2.6. Let j > be an integer and T be a rooted tree whose outline does not contain 
any Ck for < /c < j — 3 and x > 0. If 

j is odd and t(Cj,x) < t(T,x) 

or 

j is even and t(T,x) < t{Cj,x), 

thenT e {Cj_2,Cj}. 

Proof. The inductive proof is analogous to that of [TJ Lemma 3.4], the only difference being 
the fact that here, sums are considered instead of products. D 

3. Minimal Trees with Respect to x 

We say that T is a minimal tree with respect to some x > 0, if it minimizes AI{T, x) among 
all trees in 7^,^. 

The key lemma is an exchange lemma which gives a local optimality criterion. 
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Lemma 3.1. Let x > and let T he a minimal tree with respect to x. If there are (possibly 
empty) rooted trees Li, ... L^, Ri, . . . , R^ and a tree Tq such that T can he decomposed as 




and such that t(Li, x) < t{Ri,x) (after appropriate reordering of the Li 's and the Ri 's), then 
max{T(Lj,x) : 1 < i < d} < mm{r(i?i,x) : 1 < z < d}. 

Proof. We need four auxiliary quantities: 

• moo(^O)^): number of matchings of Tq of cardinality k where neither v nor w is 
covered. 

• mio(To, k): number of matchings of Tq of cardinality k where v is covered, but w is 
not. 

• moi(^O) k): number of matchings of Tq of cardinality k where w is covered, but v is 
not. 

• mil (^0 ) k) '■ number of matchings of Tq of cardinality k where both v and w are covered. 
The corresponding polynomials are denoted by Mij(TQ,x) = ^^?tt,jj(To, A;)x . Define 

GiLi,...,Ld,Ru...,Rd;x) := MooiTo,x)(l + x^TiLi,x)j (l + x^T{Ri,x)j 

+ Mio{To,x)(l + xY,r{R^,xyj + Moi(ro, x) M + x^r(Li, x) J +Mii(ro,x). 
^ 1=1 ^ ^ 1=1 ^ 

Then it is easily seen that 

d d 

M{T,x) = G{Li, . . . ,Ld,Ri, . . . ,Rd;x)'[[M{L„x)llM{Ri,x). 

i=l 4=1 

In view of the minimality of M{T, x), we must have 

G(Li, ...,Ld,Ri,...,Rd;x)< G(7r(Li), . . . , 7r(Lrf), tt{Ri), . . . , 7T{Rd); x) 

for all permutations tt of {Li, . . . ,Ld,Ri, ■ ■ ■ ,Rd}- Ignoring the assumption t(Li,x) < 
t{Ri,x) for the moment, we see that the minimum of the first summand among all pos- 
sible permutations is attained if 

(9) max{r(Lj, x) : i = 1, . . . ,d} < min{T(i?j, x) : i = 1, . . . ,d} or 

min{r(Lj, x) : i = 1, . . . , d} > max{r(i?i, x) : i = 1, . . . ,d} 
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by standard arguments (note that the sum of the two factors does not depend on the permu- 
tation). The sum of the second and the third summand is minimized if max{r(Lj,x) : 
i = l,...,d} < min{r(i?j,x) : i = l,...,d} in the case Miq{To,x) < Mqi{To,x) and 
is minimized if min{r(-Li,rc) : i = l,...,d} > max{r(i?j,x) : i = l,...,d} in the case 
that Mio(ro,x) > Moi(ro,a;). Therefore, the minimahty of G yields Q. The assumption 
t{Li,x) < t{Ri,x) imphes the first possibihty. D 

The remaining steps of the proof in [7J do not depend on the particular recursions any 
more, they only depend on the monotonicity properties in Lemma [2.3l and Lemma [2.61 as well 
as the local optimality criterion in Lemma 13.11 The only difference is that all inequalities 
have to reversed (and p{T) has to be replaced by t{T,x)). Thus the basis of the induction 
can be formulated as follows. 

Lemma 3.2. Let T be a minimal tree with respect to x > and let j be the least nonnegative 
integer such that the outline graph of T contains a Cj . Then the outline graph of T contains 
Cj at most {d — 1) times and there is a vertex v of the outline graph of T which is adjacent 
to all copies of Cj in the outline graph ofT. 



Proof. Analogous to the proof of [7, Lemma 4.3]. 
The inductive step can be formulated as follows. 



D 



Lemma 3.3. Let x > 0, T be a minimal tree with respect to x, k be a nonnegative integer 
and assume that the outline graph of T can be decomposed as 





for some rooted trees Lj. (possibly empty) and R^ with 

k is even and T{Ck,x) < T{Lk,x) < r(Cfc+2,x) 

or 

k is odd and T{Ck+2,x) < T{Lk,x) < T{Ck,x) 

or 

Lk = Ck- 

Assume that R^ is non-empty and the outline of R^ does not contain any Ci with i < k. 
Then exactly one of the following assertions is true: 

(1) Rk G {Ck,Ck+i,Ck+3}, 

(2) Rk consists of d branches Ck+i, Ck+i, Ci^, . . . , Ci^ with ti G {/c, k + l,k + 2} for 
3<i<d, 

(3) the outline of Rk can be decomposed as 
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for Bk^i, . . . , Bk^d-1 £ {Ck,Ck+2} o-nd a non-empty rooted tree Rk+i whose outline 
does not contain any Ci for i < k. Furthermore, 

k is even and T(Cfc+3,x) < t{Li:^i,x) < T{Ck+i,x) 

or 

k is odd and r(Cfc+i,x) < r(Lfc_|_i,x) < r(Cfc+3,x) 
where L^+i is defined as follows: 




Proof. Analogous to the proof of [71 Lemma 4.4]. 



D 



Repeated application of Lemma 13.31 now yields Proposition 11.21 and thus also our main 
theorem. 



Proof of Proposition \1.SX Analogous to the proof of [71 Theorem 1] . 

4. The value of the minimal energy 



D 



Since the extremal trees are described in terms of complete d-ary trees, we have to study 
the energy of these trees first. Note that, in view of ^ and the definition of t{M, x), we have 

M{Ch,x) = -^-.M{Ch-i,xY. 



T{Ch,x) 



Iterating this equation yields 



,x 



dP--i 



and in view of Lemma 12.21 this gives us the explicit formula 



where 



Q,{x) : 



u{x) :- 
v{x) :- 



u{xY — v{xy 

u{x) — v{x) 
l + ^/TTMx 



1 - VI + 4dx 



The denominator u{x) — v{x) has been introduced such that Qj{x) is always a polynomial: 
Indeed, the recursion 

Qi{x) = 1, Q2ix) = 1, Qj{x) = Qj-i{x) + dxQj-2{x) 

holds, and it follows by induction that Qj is a polynomial of degree [(j — l)/2j. 
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Now we have 



h 



h 

3=1 



i=2 i=2 



-df^-^ 



where the fact that Qi{x) = 1 has been used. It turns out that the zeros of Qj can be 
exphcitly computed. If Qj{x) = 0, then u{xy = v{xy , and so 

u{x) 1 + VTTMx 



v{x) 1 - ^/TTMx 
has to be an j-th root of unity (^. Then, 

C 1 



x 



d(l + C)2 2d(l + Re(C))' 



Thus, X has to be of the form 



X = —- 



2d 



(l + cos Y 



for some < k < ^. However, note that x = — ^ is also a zero of the denominator u{x) — 
v{x) = \/l + Adx, and that there are no double zeros, since the derivative is given by 

^ {u{xy - v{xy) = ^== Hxy-' + v{xy-') , 

which cannot be if u{x) = Cv{x) for a j'-th root of unity (" 7^ — 1. Hence, the zeros of Qj are 
precisely the numbers 



1 



2d{l + cos ^) 



, k = l,2,. 



J-1 



and it follows from ([T]) that the characteristic polynomial (l){Ch,x) can be written as 



^{Ch,x) = x^M{Ch,-x~^) = x^Qh+i{-x-')llQ,{- 



■X 



-2\{d-l)d'^-J 



Hence, the nonzero eigenvalues oi (f){Ch,x) are 



±x 2d\ 1 + cos- 



2k-K 



k'K 



±2\/dcos — , A; = l,2,..., 
J 



J-1 



with multiplicity [d — l)d ^ for j = 1,2, ... ,h and multiplicity 1 for j = h + 1. If follows 
that the energy of Ch is given by 

/ h L(i-i)/2j , \ LV2J , 

E{Ch)=i^id-l)d^'^ Y^ 4^/dcos— +^4ydcos-^. 

\i=l k=l -^ J k=l 



10 

Noticing that 
this reduces to 
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L(^,^/2J ^^ |i(eoti^-l) i^O mod 2 



fc=i 



CSC 1^ — 1 ) j ^ 1 mod 2, 



E{Ch) = 2Vd{d - 1) 



V..' 



.3 = 1 \ J / - = 1 



\ 



mod 2 



vr 



'^-i / CSC — - 1 
2j 



j=l mod 2 



2Vd( 



CSC 



2\/d f cot 



2(/i+l) 



Ij h = mod 2, 
1 1 h=l mod 2. 



2(fe+l) 

Next, we determine the asymptotic behavior of the energy of Ch'- 
Lemma 4.1. The energy of a complete d-ary tree Ch satisfies 

E{Ch)=ad\Ch\+0{l), 
where \Ch\ denotes the number of vertices of Ch and ad is given by (j3]). 

Proof Note that \Ch\ = ^ = ^ + 0(1) and cot ^^ = ^ + 0(1), esc ^ = ^ + 0(1), so 
that 

E{Ch) = 2Vd{d - l)d^ 
AVdh 



j=0 mod 2 jf=l mod 2 



V 



vr 



+ o(i) 



Qd|0/,| - 2\/d(d - l)(i'* 
A^/dh 



E -(-§-)- E -(»4-0 



\j=0 mod 2 



j>h 
j=l mod 2 



vr 



0(1) 



ad\Ch\ - 2Vd{d - l)d'' y d-^ C^ + 0(1)] + ^^ + 0(1) 

^ — ' \ TT J vr 

j>h ^ ^ 



ad 



Ch\ -2Vd{d-l)d'' 



2hd''^ ;,A Ay/dh 



(.-!),+''(''">) +^ + °« 



D 



= ad|0;,| + 0(l), 
as claimed. 

Now, we are able to prove our main asymptotic result: 
Proof of Theorem \^ Using the decomposition of T'* ^ as shown in Definition II. H we note that 
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(e-1 d-i \ f ^ 

k=Oj=l J \j=l 

for arbitrary x > 0, since every matching in the union |J^ IJ • B^j is also a matching in T* ^, 
whereas every matching of T* ^ consists of a matching in |J^ |J ■ B^j and a subset of the 
remaining < d{i + 1) edges. Making use of ^ once again, this imphes that 



e-i d-l d 

k=0 j=l j=l 

-1 d-l d 



X ^logfl + x^) dx. 



< E E ^(^'^.i) + E ^(^^.j) + -'^(^ + 1) / 

fe=oj=i i=i •^'J 

Since Jq°° x"^ log(l + x^) dx = tt, this imphes that 

^-1 d-l d 

E{Tid) = E E ^(^'^.^■) + E ^(^^.^■) + o(^) 

fc=0 j=l j=l 

^-1 d-l d 

= T.Y. i^d\BkA + 0(1)) + E ("-^i^^'^-i + 0(1)) + o{(-) 

k=0 j=l j=l 

(e~i d-l d \ 

EEi^'^.^-i + Ei^^.ii +o(^) 
k=0 j=l j=l J 

= a,{\T:^,\-0{£)) + 0{e) 
= adn + 0{t). 

It is not difficult to see that £ = O(logn) (this follows from ([3]), see [6] for a detailed analysis), 
and so we finally have 

E(r;j = arfn + 0(logn), 
which finishes the proof. D 

The following table shows some numerical values of the constants a^: 
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d 


ad 


2 


1.102947505597 


3 


0.970541979946 


4 


0.874794345784 


5 


0.802215758706 


6 


0.744941364903 


7 


0.698315075830 


8 


0.659425329682 


9 


0.626356806404 


10 


0.597794680849 


20 


0.434553264777 


50 


0.279574397741 


100 


0.198836515295 
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